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Abstract 

Consider a bin containing n balls colored with two colors. In a 
/c-query, k balls are selected by a questioner and the oracle's reply is 
related (depending on the computation model being considered) to the 
distribution of colors of the balls in this /c-tuple; however, the oracle 
never reveals the colors of the individual balls. Following a number of 
queries the questioner is said to determine the majority color if it can 
output a ball of the majority color if it exists, and can prove that there 
is no majority if it does not exist. We investigate two computation 
models (depending on the type of replies being allowed). We give 
algorithms to compute the minimum number of 3-queries which are 
needed so that the questioner can determine the majority color and 
provide tight and almost tight upper and lower bounds on the number 
of queries needed in each case. 
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1 Introduction 



We are given a bin containing n balls colored with two colors, e.g., EK red 
and blue. At any stage, we can choose any k of the balls and ask the question 
"do these balls have the same color?". The aim is to produce a ball of the 
majority color (meaning that the number of balls with that color is strictly 
greater than that of the other color), or to state that there is no majority 
(i.e. there is the same number of red and blue balls) using as few questions 
as possible. We are considering the worst case problem, that is our aim is to 
find the number of questions the fastest algorithm uses in the worst case. 

1.1 Model of computation 

In computing the majority there are two participants: a questioner (denoted 
by Q) and an oracle (or adversary) denoted by A (see Figured]). The ques- 
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Figure 1: A questioner (Q), an oracle (or adversary) (A) and a /c-query 
input. 

tioner asks questions on the status of the color of the balls by providing 
the oracle with k balls and the oracle provides a reply which, depending on 
the type of the oracle, is related to the distribution of colors of the balls 
concerned. 

Although the answer of the oracle depends on the /c-tuple provided by 
the questioner, it is otherwise permutation-independent, i.e., the answer is 
independent of the order of the balls. Moreover, the balls are endowed with 
distinct identities (say, integers 1,2, ... ,n). Notice that in the course of 
querying the questioner is never allowed to see the color of any ball but 
merely relies on the answer provided by the oracle. 

Majority Problem. Given n balls of two colors, for a solution to the 
majority problem we are interested to determine 
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1. whether or not there is a color occurring in the majority (i.e., more 
than half) of the balls, and 

2. if indeed there is majority, to output a ball having the majority color. 
In addition, we would like to minimize the number of queries required. 

Query Models. Next we define two query models which will be considered 
in the sequel. In each model the input to the oracle is a set {xo, x±, . . . , Xk~i\ 
of k balls. 

1. Y/N Model: The answer is either yes or NO. yes means that all 
balls have the same color, NO means that the balls in the fc-tuple do 
not have the same color. 

2. Pairing Model: The answer to a query is either yes or NO. yes means 
that all balls have the same color. NO means that not all the balls have 
the same color and to show this, (any) two balls of different color are 
also provided. 

1.2 Notation 

Let ql{n), qk{n) denote the minimum number of queries which are sufficient to 
solve the majority problem for n balls colored with two colors using /c-queries 
in the Y/N and Pairing models, respectively. It is obvious that ql(n) < qk{n) 
(assuming that these numbers exist). 

1.3 Related work 

Our study is a natural generalization of the well-known majority problem [1] 
where at any stage two balls a and b are chosen and the question is "do a 
and b have the same color?". It is obvious that q2{n) = ?f( n ) an d Saks and 
Werman [15] showed that q2{n) = n — u(n), where z/(n) denotes the number 
of l's in the binary representation of n. Later Alonso, Reingold, and Schott 
[6] also gave the solution for the average case. Aigner [21 13] introduced several 
variants and generalizations of the majority problem. In particular, in the 
(n, fc)-majority game one has to show a fc-majority ball z (that is, there are 
at least k balls colored like z), or declare there is no fc-majority. 
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Other variations of the majority problem include the case where more 
than two colors are available. Fisher and Salzberg [IT] studied the majority 
problem when the number of colors is any integer up to n. In this case the 
majority color is the color such that there are at least n/2 + 1 balls of that 
color. They solved the problem by showing that [3n/2] — 2 comparisons 
are necessary and sufficient. Another natural generalization is the plurality 
problem where the goal is to identify a ball of the dominant color (i.e., the one 
occurring more often than any other). In [I] linear upper and lower bounds 
were given for 3 colors. The currently best result for any number of colors was 
given in [8]. The authors of [8] studied also the oblivious versions both for 
the majority and the plurality problem. In oblivious strategies the questioner 
has to ask all questions in advance before getting any answer from the oracle. 
Finally, bounds for randomized algorithms can be found in [TUl [H] . 

1.4 Outline and results of the paper 

Section [2] discusses the problem of existence of a solution to the majority 
problem in the models proposed. In Section [3] we are considering the Y/N 
model for k = 3 and give lower and upper bounds for q 3 (n) whose difference 
is between 1 and 3, depending on the residue of n modulo 4. Section H] 
investigates the Pairing model. Here we give a general lower bound for q^(n) 
and compute the precise value of q^{n), namely 



2 Existence of Solutions 

Before trying to compute q^.(n) and qk{n) we should discuss whether these 
numbers exist at all, since it may happen that asking all possible queries is 
not enough for Q to solve the problem. It is clear that n should be at least k 
and if n = k, then the only possible query is enough in both games for k = 2 
and in the Y/N model for k = 3, but not in the other cases. For k > 3 we 
prove that quin) and q^iji) exist if and only if n > 2k — 2 and n > 2k — 3, 
respectively. 

Theorem 2.1 Let k>3. 

1. The number qk{n) exists if and only if n > 2k — 2. 
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2. The number q^(n) exists if and only if n > 2k — 3. 
The proof can be found in the Appendix. 

3 Y/N Model 

It might be worth observing that in the Y/N model while a NO answer on a 
pair of balls tells us that these balls have different colors and consequently 
they can be discarded by the questioner, when the number of balls is greater 
than two then a NO answer only tells us that there are (at least) two balls 
of different color. Therefore, if on the one hand it seems more advantageous 
comparing more balls at a time, on the other hand it is more challenging for 
the questioner to exploit a less informative NO answer. 

First we give upper bounds and next we discuss lower bounds on the 
number of queries. We conclude with some examples. 

3.1 The upper bound 

In this section we give an algorithm for solving the majority problem. We 
start with a straightforward extension of a result of Saks and Werman. 

Lemma 3.1 For all n odd we have qk{n) < qk(n — 1). 

The proof is basically the same as the proof of Saks and Werman for k = 2; 
we include it in the Appendix for completeness. 
Let us now describe our algorithm. 

Algorithm MAJORITY3. Let m = [n/4\. Consider an arbitrary partition 
of the n balls in m groups G\, G2, ■ ■ ■ , G m of size 4 each and let R be the 
group of the r = n mod 4 remaining balls. For i — 1, 2, . . . , m, we make all 
( 3 ) = 4 possible triple queries on the four balls of Gi till we get a yes answer. 
There are two cases: 

• (a) for some j we get a yes answer on at least one of the 4 triple queries 
involving balls of Gj] 

• (b) we get always NO answers on all the 4m comparisons. 
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In case (a), we can discard all balls contained in G±, . . . , Gj_i as they 
have no effect for determining the majority. Let a, b, and c the three balls in 
Gj that have the same color. Notice that we also know whether or not the 
color of the fourth ball d in Gj is the same as the color of the other three 
balls. The number of queries required up to this point is 4j. From now on, 
all the remaining n — 4j balls are compared, one at a time, with two balls 
of identical color, e.g. a and b. It is clear that this way we can count the 
number of balls colored like a and b and the number of balls colored like a 
different ball e, if it exists. This will allow us to solve the problem by using 
a total number of at most 4j + (n — 4j) = n queries. 

In case (b), we have found that there is an equal number of red and blue 
balls among the 4m balls in G\ U G^ U • • • U G m . Therefore, in this case, the 
majority is determined by the remaining r balls in set R. Hence, we have 
to determine, for every r G {0,1,2,3}, the number g 3 (4m + r). We limit 
our analysis to g 3 (4m) and g 3 (4m + 2) as, in view of Lemma 13.11 we have 
g 3 (4m + 1) < q 3 (4m) and (fc^m + 3) < q^m + 2). If r = 0, the algorithm 
can already state that there is no majority. If r = 2, the problem reduces to 
ascertain whether these two remaining balls have the same color (in which 
case any of them is in the majority) or not (which implies that there is no 
majority). In order to do so, we compare the 2 remaining balls (call them 
x and y) with three arbitrary balls a,b,c (one at a time) from G\. Namely, 
we perform the following queries: {x,y,a}, {x,y,b} and {x,y,c}. If x and 
y are identically colored, we obtain a yes (recall that, since we are in case 
(b), there must be two different balls among a, b, and c); otherwise we always 
obtain NO answers. Therefore, in both cases we can solve the ptoblem using 
at most 3 additional queries. 

In conclusion, we have proved the following. 

Lemma 3.2 Let n = 4m + r for some r G {0, 1, 2, 3} and m > 1. 

g3(4m) < 4m = n; 
g 3 (4m + 1) < 4m — n — 1; 
g 3 (4m + 2) < 4m + 3 = n + 1; 
g 3 (4m + 3) < 4m + 3 = n. 
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The next lemma will be useful in the proof of the lower bound. 

Lemma 3.3 With at most one additional comparison, algorithm Majority 3 
is able to output a number i such that there are i balls of one color and n — i 
of the other color. 

The proof of Lemma 13.31 can be found in the Appendix. 
3.2 The lower bound 

In the sequel, a coloring is a partition (R, B) of the set of balls into two sets 
R and B, where R is the set of red balls and B the set of blue balls. 

In this section we will give a lower bound on the number of queries needed 
to determine the majority. Our aim will be to construct a worst case input 
coloring for any unknown correct algorithm that solves the majority problem. 
We use the usual adversary lower bound technique. 

We say that an adversary's strategy admits a coloring if such a coloring 
is consistent with all the answers provided by the adversary. As long as the 
strategy devised by the adversary admits alternative possible colorings that 
are consistent with at least two different possible solutions for the majority 
problem, the algorithm cannot correctly give its output. The goal of the 
adversary is to maximize, with its strategy of answers, the number of rounds 
until the algorithm can correctly give its output. 

We will first consider the case of an even number n of balls. Given a 
sequence of queries S = (Qi, ■ ■ ■ ,Qt) and a positive integer i < t, let us 
define the following property: 

: for all AC [n], with |A| = m, there exists j < i such that one of 
the following conditions hold: 

(a) Qj C A; 

(b) Qj (IX = 0. 

The adversary's strategy. The strategy followed by the adversary 
is defined as follows. To every query Qj, the adversary replies NO as far 
as does not hold. Let i be the first index for which holds, if it 

exists. Then there exists a set X C [n], with \X\ = m, such that Qj $2 A and 
Qj <2 X = [n] — X for j = 1, 2, . . . , i — 1 (this is because i is the smallest index 
such that holds), and either Qi C A or Qi C X (these are conditions 

(a) and (b) on £P(i)). From this point forth, the adversary replies yes to Qi 
and all subsequent queries {a, b, c} if all three elements a, b, c belong to the 
same set A or A, and replies NO to all the other queries. 
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Lemma 3.4 Let n = 2m for some m > 1. Any algorithm that solves the 
majority problem on n balls uses at least n — 1 queries. 

Proof: Assume that Q%, . . . , Qt is the sequence of queries that the algorithm 
produces before giving its output. We distinguish two main cases: either 
&>(t) does not hold or @>{t) holds. 

Case 1: &(t) does not hold. In this case there exists X C [n], with 
\X\ = m, such that for all queries Qi,...,Q t neither (a), nor (&) holds. 
According to its strategy, the adversary replies always NO in this case. 

If the algorithm states that there is a majority, the algorithm actually 
has to show a ball in X or in [n] \ X. The adversary's strategy admits the 
following coloring: color red all the balls in X and blue all the remaining 
balls. Indeed, such a coloring is consistent with all the NO answers provided 
by the adversary, as since property ^(t) does not hold, there is no query 
entirely contained in X or in X. This means that there is no majority, which 
contradicts the algorithm. 

Let us assume the algorithm states that there is no majority. Suppose that 
there exists a ball x in X such that Qi <2 X U {x} for every i — 1, 2, . . . , t. In 
this case, the adversary's strategy allows us to produce the following coloring: 
color red all the balls in X U {x} and blue the others. Indeed, it is easy to 
observe that, since Qi % X U {x} for every i = 1,2, ... ,t, such a coloring 
is consistent with the NO answers provided by the adversary to each Qi for 
i = 1,2, ... ,t. This implies that there is a majority: once again the claim 
of the algorithm is contradicted. The case in which there exists a ball x in 
X such that Qi (£ X U {x} for every i — 1, 2, . . . , t, is similar. It remains to 
analyze the case when both the following conditions hold: 

(1) for every x G X there exists a query Qi for some i G {1, 2, . . . , t}, such 
that ftCJU {x}; 

(2) for every x G X there exists a query Qi for some i G {1, 2, . . . , t}, such 
that Q l CXU{x}. 

Condition (1) implies that the sequence of queries Qi,Q2, ■ ■ ■ ,Qt must 
contain a query {x, x±, X2} for every x G X and for some Xi, £2 G X. Analo- 
gously, condition (2) implies that there must be a query {x', x[, x' 2 } for every 
x' G X and for some x' Xl x ! 2 G X. Since all these queries are clearly distinct, 
t > 2m = n. 

Case 2: £P(t) holds. In this case, the adversary replies NO to all the 
queries Qj, as far as does not hold, and replies yes to the first query 

Qi for which holds. Since holds, while for j = 1, 2, . . . , i — 1 
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does not hold, there exists X C [n], \X\ = m, such that either Qi C X or 
Qi ^ X, and 

^ X and Qj % X for j = 1, 2, . . . , i - 1. (1) 

In the following, we assume without loss of generality that Qi C X. 
Notice that this time the adversary's strategy admits any coloring consistent 
with all the NO answers given to Qi, Q2, ■ ■ ■ , Qi-i an d the yes answer to Qi. 

If the algorithm concludes that there is a majority, it can be easily con- 
tradicted by observing that the adversary's strategy admits a coloring that 
colors red all the balls in X and blue all the others. 

Therefore, suppose the algorithm states that there is no majority. If there 
exists a ball x G X \ Qi (resp. y G X) such that for every j = 1,2, ... ,i, 
Qj $Z X U {x} (resp. Qj <2 X U {y}), the adversary's strategy admits the 
following coloring. Color red all the balls in X \ {x} (resp. X \ {y}) and blue 
the balls in X U {x} (resp. X U {y}), so to have a majority in the latter set. 
This contradicts the algorithm's claim. 

It remains to analyze only the case when both the following conditions 
hold: 

(1*) for every x G X\Qi there is a query Qj for some j G {1,2, ... ,i— 1}, 
such that Qj C X U {x}; 

(2*) for every y G X there is a query Qj for some j G {1,2, . . . ,i — 1}, 
such that QjCIU {y}. 

Let J-i and be the set of queries necessary for (1*) and (2*) to be 
satisfied, respectively. Condition (1*) implies that the sequence of queries 
Qi, Q2, ■ ■ ■ , Qi-i must contain at least a query {x, y, z) for every x G X \ Qi 
and for some y, z G X . Condition (2*) implies that there must be also at 
least a query {x', y', z'} for every x' G X and for some y', z' G X. Therefore, 

+ |J* 2 | > 2m - 3. (2) 

In order to estimate the total number of queries, we need also to consider the 
set J-3 of queries that involve balls in Qi = {a,b,c} and are not considered 
in T\ and J-" 2 - The query Q t clearly belongs to J-3, so 

\K\ > 1- (3) 

Now we have to distinguish two cases. 

Case (i): there is no query Q' = {x, y, z}, such that x G X and y, z G Qi. 
In this case, JF% must contain at least another query Q that contains some 
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element from {a, b, c} and two elements from X, otherwise the adversary's 
strategy would admit a coloring where all the balls in X U {a, b, c} are red, 
which would imply the existence of a majority in this set. Hence, > 2 
and by (J2J) the total number of queries is + jj^l + I ^3 1 > 2m. — 1. 

Case (ii): there exists a query Q' = {x, y, z} such that x G X and 
y,z e Qi. Let s be the number of queries involving one ball from Qi and two 
balls from X. We may assume that s < 2, otherwise IJ-3I > s + 1 > 2 and 
the proof would be complete. If s < 2, we must have that 

(*) there exist two balls a and b in Qi such that there is no query involving 
a or b with two balls from X. 

Let us consider x,y,z G Q' and recall that x G X and y, z G Qi- By (*) we 
have that 

(**) there exists a ball v G {y, z} C Qj such that there is no query 
{v, W\, W2} with wi, W2 G X. 

Moreover, since we are assuming that + jj^l = 2m — 3, condition (2*) 
implies that 

(***) for every x G X there is exactly one query Q x = {x, y x , z x } that 
includes x and some y x ,z x G X. 

Let Y = X \ {v} U {x}. We will show that there is no query among 
Q11Q2, ■ ■ ■ ,Qi that is entirely contained in Y or Y, which contradicts the 
hypothesis that &(t) holds with i being the smallest index such that for all 
X C [n], with \X\ = m, there exists j < % such that either Qj C X or 

c X. 

Since now x G y and w G Y, we can easily verify that Qi % Y and 
Qi ^ Y. It remains to show that no query among Qi, Q2, ■ ■ ■ , Qi-i is entirely 
contained in Y or Y. 

As a consequence of (**), there is no query involving v entirely contained 
in Y. Moreover, by (***) Q 1 must be the only query involving x G X and 
two balls in X. Hence, there is no query involving x entirely contained in 
Y. Let Q be the set of remaining queries to analyze, i.e. the queries not 
involving v nor x. By the definition of Y and Q, it is straightforward to 
observe that for all Q G Q, Q C Y (resp. Q C F) if and only if Q C X (resp. 
Q C X). Therefore, since % is the smallest index such that holds, there 
is no query in Q entirely included in Y or in Y. This is a contradiction and 
the proof is now complete. ■ 

Lemma 3.5 For all n odd we have q^{n) > n — 3. 
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The proof is based on Lemmas 13 . 3 1 and 13 . 4 1 and can be found in the Appendix. 
Now we can state the main result in the following theorem. 

Theorem 3.1 Let n = 4m + r for some r G {0, 1,2,3} and m > 1. We 
have: 

n — 1 < g 3 (4m) < n; 

n — 3 < gs(4m + 1) < n — 1; 

n - 1 < g 3 (4m + 2) < n + 1; 

n — 3 < g3 (4m + 3) < n. 

Proof: The proof follows immediately by combining Lemma 13.21 with Lem- 
mas 13.41 and 13.51 ■ Tightening the bounds of Theorem 13.11 so as to compute 
the exact value of q% (n) does not seem to be easy, but we provide the exact 
values of qzin) for n < 6 in the appendix. 

4 Pairing Model 

In this section we focus on the Pairing model. First we prove a lower bound 
on /c-tuples that does not depend on k and then show that the bound is tight 
for k = 3. 

4.1 A general lower bound 

First we prove a lemma. 

Lemma 4.1 Let n > 2k — 3, k > 3. Then 



Proof: First we describe a strategy for the Adversary which guarantees that 
the number of queries the questioner should use is at least [n/2\ if n is odd. 
The strategy is quite simple, A always answers NO whenever it is possible, 
that is, if it is not known before the query that all k balls asked have the 
same color (in which case the answer is yes, of course, but such a question 
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will never be asked). A also has to show two balls of different color; these 
can be any pair of balls that may have different color before the query. 

To see that Q has to use at least [n/2\ queries even against this simple 
strategy, we use graphs to describe the knowledge of Q (this is the same 
graph we use in the proof of Theorem 12. ip . In ths Pairing model to every 
negative answer of A there corresponds a pair of balls having different colors. 
Let these pairs be the edges of a graph G, whose vertices are the balls. 

By the strategy of A there exists a coloring of the balls (with colors blue 
and red) such that there is no edge between two balls of the same color; 
therefore the graph G is bipartite. Suppose now that Q can show a ball of 
majority color. This is possible if and only if there is a vertex x of G that 
always appears in the greater part in every bipartition of G. (Notice that 
now there is a majority color, since n is odd.) Now it is easy to see that G 
has at least \n/2\ edges (for completeness this appears in the Appendix as 
Lemma [5. II) . thus the number of queries is also at least |_rz/2j . 

Now we slightly modify the above strategy to obtain the lower bound 
for n even. Q can declare that there is no majority if and only if in every 
bipartition of G the two parts have the same size, which is impossible if there 
is an isolated vertex in G. 

Suppose that A answers the first n/2 — 1 questions the same way as above. 
We consider two cases. 

Case 1. The edges of G after the n/2 — 1 queries are not independent. Now 
the answer to the next query is also answered the same way as above. Since 
there is an isolated vertex in G (we have n/2 edges that are not independent), 
Q cannot declare that there is no majority. Moreover, no vertex x can always 
appear in the greater part in every bipartition of G, because there are only 
n/2 edges in G (in the part where such an x would appear, there would be 
at least n/2 + 1 non-isolated vertices). This completes the proof in Case 1. 

Case 2. The edges of G after the n/2 — 1 queries are independent. Now A 
has to be careful, since an edge between the two remaining isolated vertices 
would guarantee that there is no majority. So the edge is drawn somewhere 
else, which is possible, because k > 3 and there is no cycle in G (that is, A 
may draw the edge between any two of the k vertices of the query). Now the 
situation is the same as in Case 1. This completes the proof. ■ 
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4.2 Determining q^{n) 

Now we prove that the bound we have just proved in Lemma 14.11 is tight 
for k — 3. This is interesting because of several reasons. The strategy 
of A that gave the lower bound is quite simple and the lower bound is 
much smaller than the value of q2{n) (as well as 93(71)). However, the most 
surprising is that the function is not increasing, since (jf (2n) — n + 1 and 
q^{2n + 1) = n. Actually, we have seen that q^ is not strictly increasing and 
the same proof shows that the same is true for q v k : q\(2n + 1) < ql(2n) and 
qk(2n + 1) < gfe(2n). Nevertheless, a complexity function that is not even 
increasing is quite unusual in search theory. 

The main theorem of this section is the following. 

Theorem 4.1 Let n > 3. Then 



Proof: We just have to show that the bounds in Lemma [4.11 are tight, 
that is, give an algorithm for the questioner Q that uses n/2 + 1 queries if n 
is even and \n/2 J queries if n is odd. 

The algorithm consists of two main parts of which the first is the formation 
of bins with balls and is similar to the one in [15]. Initially place all the n 
balls in n different bins and at each step do the following. 

1. Select any 3 equal size bins and pick one representative from each of 
them and query these 3 balls. 

2. If the answer is yes, then merge the bins into one new bin. 

3. If the answer is NO, then A specifies two balls of different colors. Re- 
move the two corresponding bins. 

4. Iterate the process until there are no 3 bins of equal size. 

It is obvious that all remaining bins have size a power of 3. The process 
stops because we can no longer find 3 bins of equal size, that is, each bin size 
occurs at most twice. It is easy to check that to build a bin of size 3 C the 
questioner Q needs (3 C — l)/2 queries, while to remove two bins of size 3 C 
each, the questioner Q needs exactly 3 C queries. Thus if the total number of 
removed balls is 2t, then Q used t queries in order to get rid of them. 
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This means that after the first main step of the algorithm, Q has n — 2t 
balls in bins whose sizes are powers of 3, bin sizes occur at most twice and 
Q used t queries to remove the 2t balls and (3 C — l)/2 queries to build a 
bin of size 3 C . In other words, if the bin size 3* occurs aj times ii = 0, 1, 2) 
then the sequence a s , a s _i, . . . , ao is the ternary expansion of n — 2t (where 
s = |~log 3 (ri — 2t)1) and Q used t + Yli=o a *(3* ~~ queries altogether. 

It is obvious that if a s = (which occurs iff n — 2t — 0), then there is no 
majority, if a s = 1, then the corresponding bin contains majority type balls, 
while if a s = 2, then Q cannot show a majority type ball, neither can prove 
that there is no majority at the moment, so we may suppose that a s = 2 
(otherwise we are done, since 2t + Yli=o a «(3 l — 1) < n , so for the number of 
queries we have that t + Z~2i=o ai ^ 1 — -0/2) < n /2)- 

Now starts the second part of the algorithm, where Q tries to eliminate 
the remaining bins. If s > 0, then Q chooses one ball from both bins of size 
3 s and a third ball from one of these bins and queries this triple. Given the 
answer, Q learns if the two biggest bins have balls of the same color or not. If 
the answer is yes, then all balls in these two bins are clearly in majority, if the 
answer is NO, then Q removes the two bins. Iterate this process if necessary 
(if the greatest bin size occurs twice), until it is possible, that is, while s > 0. 
To remove two bins of size 3 C each, Q still needs exactly 3 C queries, therefore 
it is clear that at the end of the second part Q can either solve the problem 
(that is, find a majority ball or show that there is no majority) using at most 
n/2 queries or Q has just two bins of size 1 and used exactly (n — 2)/2 queries 
to reach this position (which is possible only if n is even). Now to finish the 
algorithm and the whole proof let the two remaining balls be a and b and let 
c and d be two balls of different color (such balls exist and are known to Q 
, since n > 3 and n is even). Now Q queries the balls a, b, c. If the answer is 
YESor the answer is NOand the two differently colored balls shown by A is 
a and b, then Q can solve the problem using n/2 queries altogether. If the 
answer is NOand the two different balls shown by A are not a and b, then 
Q queries the balls a,b,d, thus solving the problem using n/2 + 1 queries 
altogether. This completes the proof of Theorem 14.11 ■ 

5 Conclusion 

In this paper we studied the minimum number of triple queries needed to 
determine the majority color in a set of n colored balls (colored with two 
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colors) under two models a) Y/N, and b) Pairing, which depend on the 
type of queries being allowed. In addition to tightening the bounds for the 
majority problem in the Y/N model, several interesting questions remain 
open for further investigation, including computing majority on a) /c-tuple 
queries, for some fixed k, b) bins with balls colored with more than two 
colors, as well as c) other natural computation models. 
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Appendix 



Proof of Theorem 12.11 

Proof: Consider /c-tuple inputs. We have to show that Q can solve the 
majority problem by asking all possible queries if and only if n > 2k — 2 in 
the Y/N model and if and only if n > 2k — 3 in the Pairing model. It is 
easy to see for both problems that if A gives a positive answer (i.e., declares 
that balls in a set S of size k have the same color), then Q can learn which 
balls have the same color as the balls in S, thus solving the problem. The 
questioner Q just has to ask every query containing the balls of S', where S' 
is an arbitrary subset of S having size k — 1. 

Thus we may assume that A never gives a positive answer. Now the 
first part of the theorem is easy to prove. If n > 2k — 1, then A must give 
a positive answer some time, since there are k balls having the same color, 
while if n = 2k — 2 and there are no k balls of the same color, then there 
cannot be majority (both colors appear k — 1 times). On the other hand, 
if n < 2k — 3 then it is possible that there are exactly k — 1 red balls and 
n — k + 1 < k — 2 blue balls, in which case the answer is always negative and 
the problem cannot be solved, since there is a majority but no ball can be 
named as one in majority. 

Since in the Pairing model Q is given more information, it is obvious that 
it can be solved if n > 2k — 2. To show that in the case n = 2k — 3, Q can 
also solve the problem, but in the cases n < 2k — 4 cannot, we use graphs to 
describe the knowledge of Q . In this approach to every negative answer of 
A there corresponds a pair of balls having different colors. Let these pairs 
be the edges of a graph G, whose vertices are the balls. 

It is straightforward that there always exists a coloring of the balls (with 
colors blue and red) such that there is no edge between two balls of the same 
color; therefore the previously defined graph G is bipartite. It is also obvious 
that Q can show a ball of majority color if and only if there is a vertex x of 
G that always appears in the greater part in every bipartition of G and Q 
can declare that there is no majority if and only if in every bipartition of G 
the two parts have the same size. 

Let us consider now the case n < 2k — 4. Let G be a graph on n < 2k — 4 
vertices with an edge set consisting of [n/2\ — 1 independent edges. It is 
easy to see that any set of k vertices spans at least one edge, so the answers 
of A are all negative and to every answer A can show a pair of balls that 
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are neighbours in G. 

Assume now to the contrary that Q can solve the problem, that is in 
every bipartition of G the two parts have the same size or there exists a 
vertex x of G that always appears in the greater part in every bipartition of 
G. 

The first case is clearly impossible since there exists an isolated vertex in 
G. For the second case we prove a useful lemma. 

Lemma 5.1 Let G be a bipartite graph on n vertices. If there exists a vertex 
x of G that always appears in the greater part in every bipartition ofG, then 
G has at least \n/2\ edges. 

Proof: ( of Lemma \5J\) . Let C be the part of a bipartition of G that contains 
x. It is easy to see that C\ {x} contains at least \n/2\ non-isolated vertices, 
otherwise moving the isolated vertices of C \ {x} to the other part of the 
bipartition would leave x in the smaller part, which is impossible. Notice 
that x itself may be an isolated vertex. Since there is no edge between two 
vertices of C, there must be at least \n/2\ edges in G. This completes the 
proof of Lemma 15.11 ■ 

By Lemma ISTTl the second case also leads to a contradiction, which shows 
that for n < 2k — 4 the questioner Q cannot solve the problem. 

Now let us turn our attention to the only remaining case, n = 2k — 3. 
Since A must always answer NO, every set of k vertices of G spans at least 
one edge, in other words a(G), the maximum size of an independent vertex 
set of G is at most k — 1. It is well-known that a(G) + t(G) = n, where r{G) 
is the minimum size of a vertex cover of G, thus r(G) > n — k + 1. G is a 
bipartite graph, so by the theorem of Konig [12] t(G) = ^(G), where v(G) 
is the maximum size of an independent edge set of G. Thus there exists an 
independent edge set X of size n — k + 1 = k — 2 = {n — l)/2 in G. Now 
the ball corresponding to the only vertex that is not covered by X must be 
in majority, thus finishing the proof of Theorem 12. II ■ 

Proof of Lemma 13.11 

Proof: 

Let S be the set of balls. Let us remove a ball x from S. By definition we 
can solve the majority problem on S \ {x} using qk{n — 1) queries. Now there 
are two cases to consider. If S\ {x} has no majority, then the number of blue 
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and red balls in the set S \ {x} is the same and therefore x has the majority 
color in the set S. On the other hand if S"\{x} has majority, then since n — 1 
is even the number of balls of the majority color is at least (n — l)/2 + 2 and 
therefore the majority color of S must be the same as the majority color for 
S \ {x}, proving qk(n) < qk(n — 1) for n odd. ■ 

Proof of Lemma 13.31 

Proof: In ) of the proof of Lemma 13.21 the statement of the lemma 

is straightforward even without the additional question. The same holds for 
case (b) if r = 0, 1, 2. Let us consider now r = 3. Let R = {x, y, z) be 
the set of remaining balls. The algorithm chooses any two among them and 
executes the same comparisons as for the case r = 2 in order to ascertain 
whether these two balls are identically colored or not. If they have different 
colors, the gap (difference between majority amd minority) is 1; if they have 
the same color, one additional comparison among x, y, and z will ascertain 
whether the gap is 1 or 3, finishing the proof. ■ 

Proof of Lemma 13.51 

Proof: Let us consider a set Sofm balls with m even. Take 3 arbitrary balls 
a, b, and c from S and solve the majority problem on the remaining m — 3 
balls by using q^m — 3) queries. In view of Lemma [3 .3[ using g 3 (m — 3) + 1 
questions we also know a number i, such that there are % balls of the majority 
and m — 3 — i of the minority color. It is obvious that i > (m — 4)/2. 

Let x be the majority ball provided by the algorithm on the m — 3 balls. 
Notice that x exists as m — 3 is odd. We first show that with at most 4 more 
queries we can solve the majority problem on the whole set S of m balls. 
First we perform the query {a, b, c}. 

Assume first that we got a YES answer on {a, b, c}. Now we perform query 
{a, b, x}. If we get a yes on {a, b, x} we infer that x is a majority ball for S. 
If {a, b, x} gives a NO answer, then three cases are possible: 

if i < y then either of a, b or c can be showed as majority ball; 

if % = y then there is no majority ball; 

if % > y then x is a majority ball. 
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If on the other hand we got a NO answer on {a, 6, c}, we can solve the 
majority problem by using the following three additional queries: {a,b, x}, 
{a, c, x} and {b, c, x}. If at least one of the three answers is yes then we can 
infer that x is a majority ball. If we get NO on all three queries, then we infer 
that there is no majority among a, b, c and x. Two cases are possible: 

if i = ^y-^ + 1, then there is no majority; 

if i > + 1, then x is a majority ball. 

This proves that with a total of (q 3 (m — 3) + 1) + 4 = g 3 (m — 3) + 5 queries 
we can solve the majority problem on the whole set S. Therefore, recalling 
that on m even, we know by Lemma 13.41 that q 3 {m) > m — 1, we have: 
q 3 {m — 3) + 5 > (73(771) > m — 1, which implies q 3 {m — ?>) > m — 6. The 
lemma follows by letting n = m — 3. This completes the proof of Lemma [3751 



Number of queries in the Y/N model, for n < 6 

In the sequel we compute the number of queries when the number of balls is 
small (n < 6). 

Example: ^3(4) = 4. There are (^) = 4 triples. By querying all four of 
them we can decide whether or not there is majority: if all four answers are 
NO there is no majority otherwise a yes answer on a triple yields a ball of 
majority color. 

Example: ^3(5) = 4. Observe that by Lemma [3.11 we have that #3(5) < 
g 3 (4) = 4. Assume to the contrary that #3(5) < 3 and let the balls be num- 
bered 1, 2, 3, 4, 5. We show that the majority problem cannot be solved with 
three queries. Consider the three triples Qi, Q 2 , Q3 that are being queried. If 
x ^ Qi UQ2 UQ3, for some ball x, then the three queries apply only to four of 
the five balls {1, 2, 3, 4, 5} \ {x}; therefore an "adversary" flipping the color of 
x can change the outcome which shows that the majority problem cannot be 
solved. Therefore it remains to consider the case Q\ UQ2UQ3 = {1, 2, 3, 4, 5}. 
It is easy to see that there are two triples, say Qi, Q2, such that IQifl^l = 2. 
W.l.o.g. assume Q x = {1,2,3} and Q 2 = {2,3,4}. Clearly, 5 G Q 3 (other- 
wise we are in the previous case and the majority problem cannot be solved). 
There are (essentially) three cases to consider depending on the other two 
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balls in Q 3 : 1) 2,3 G Q3, 2) 1,3 G Q3, and 3) 2,4 G Q3, Again, a simple 
"adversary argument" shows that the majority problem cannot be solved in 
any of these cases. 

Example: 93(6) = 7. Lemma [3T21 tells us that q^{Am + 2) < 4m + 3 = n + 1. 
For m — 1 it follows that 7 queries are sufficient for 6 balls. What is more 
important is the following claim. 

Claim. 6 queries are not enough for 6 balls. Suppose the contrary. Since 
not all 10 pairs of complementary triples are asked, the 3 red, 3 blue situation 
is always possible if all the answers are NO. We show that if the triples are 
not in a special setting (described later), then the 4 blue, 2 red situation is 
also possible if all the answers are NO. 

If the 4-2 situation is not possible with just NO answers, then every 4- 
tuple contains a triple we asked. In other words, the complements of our 
triples contain every possible pair of balls. So we would like to cover the 
edges of the complete graph on 6 vertices with 6 (or less) triangles. Since 
a triangle covers 2 edges of the same end-vertex (i.e. triangle 123 covers 12 
and 13) and every vertex has degree 5 in Kq (which is odd), this is possible 
if and only if three edges forming a perfect matching are double covered and 
all the others are simple covered (6 triangles have 18 edges and we have 15 
edges (to see that 5 triangle is not enough is even easier)). Let these edges 
be 12,34,56. 

Now we give an Adversary strategy: the first five questions are answered 
NO, the sixth is NO, if the above setting is not satisfied, otherwise it is yes. 
In the first case both the 3-3 and the 4-2 situations are possible. We show the 
same for the second case. Now all triples contain either 12, 34, or 56 (since we 
have six triples and these edges are double covered) , so we may suppose that 
the last triple was 456 (that is, the last question was 123). We know that 1, 2, 
and 3 are of the same color, but the other balls may all have the other color, 
since the only question (answered no) that is not consistent with this is the 
question 456 and it is easy to see that this cannot be a question, because in 
this case we have the triples 123 and 456 and all other four triples have just 
one ball in common with one of them, so there are four double covered edges, 
which is impossible. It remains to show that the 4-2 situation is also possible. 
We know that 123 is a question, let the other question whose complement 
covers 56 be 1234 \ {x}, where x is 1, 2, or 3. Let y be an element from 1, 2, 3 
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different from x. Then the answers are consistent with the situation that 
1, 2, 3, and y are blue, the others are red. This finishes the proof. 
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